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Abstract 

u. 

We show that in a knapsack feasibility problem an integral vector p, which is short, and near 
parallel to the constraint vector gives a branching direction with small integer width. 

We use this result to analyze two computationally efficient reformulation techniques on low 
density knapsack problems. Both reformulations have a constraint matrix with columns reduced 
in the sense of Lenstra, Lenstra, and Lovasz. We prove an upper bound on the integer width 
along the last variable, which becomes 1, when the density is sufficiently small. 

In the proof we extract from the transformation matrices a vector which is near parallel to 
^ ' the constraint vector a. The near parallel vector is a good branching direction in the original 

knapsack problem, and this transfers to the last variable in the reformulations. 
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1 Introduction and notation 

Geometry of Numbers and Integer Programming [22] 



Starting with the work of H. W. Lenstra [18] , algorithms based on the geometry of numbers have 
been an essential part of the Integer Programming landscape. Typically, these algorithms reduce 
an IP feasibility problem to a provably small number of smaller dimensional ones, and have strong 
theoretical properties. For instance, the algorithms of [181 [T2| [T9] have polynomial running time 
in fixed dimension; the algorithm of [7] has linear running time in dimension two. One essential 
tool in creating the subproblems is a "thin" branching direction, i.e. a c integral (row-)vector with 
the difference between the maximum and the minimum of ex over the underlying polyhedron being 
provably small. Basis reduction in lattices - in the Lenstra, Lenstra, Lovasz (LLL) [17] . or Korkine 
and Zolotarev (KZ) |13[ [T2] sense - is usually a key ingredient in the search for a thin direction. 
For implementations, and computational results, we refer to [4"t ll0ll2"T] . 

A simple, and experimentally very successful technique for integer programming based on LLL- 
reduction was proposed by Aardal, Hurkens and A. K. Lenstra in [2] for equality constrained IP 
problems; see also pp. Consider the problem 



(IP-EQ) 



where A is an integral matrix with m independent rows, and let 

N(A) = {x £Z n \Ax = 0}. (1.1) 

The full-dimensional reformulation proposed in [2] is 



Ax 
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o< 


X 


<v 


X 


G 


Z n , 



(IP-EQ-N) 



—Xb < V\ < v — xt, 

A G Z n - m . 

Here V and Xb satisfy 

{ VX | A G Z n - m } = N(A), x b G Z", Ax b = b, 

the columns of V are reduced in the LLL-sense, and x& is also short. For several classes of hard 
equality constrained integer programming problems - e.g. [5] - the reformulation turned out to be 
much easier to solve by commercial solvers than the original problem. 



In [H] an experimentally just as effective reformulation method was introduced, which leaves the 

number of the variables the same, and is applicable to inequality or equality constrained problems 

as well. It replaces 

Ax < b , s 

IP 

with 

(AU)y <b /m , 

V Jy ~ (IP-R 

v e^, 

where U is a unimodular matrix that makes the columns of AU reduced in the LLL-, or KZ-sense. 
It applies the same way, even if some of the inequalities in the IP feasibility problem are actually 
equalities. Also, if the constraints are of the form b' < Ax < b in (|IP[) . the reformulation is 
just b' < (AU)y < b, so we do not bring the system into a standard form. In [14j the authors 
also introduced a simplified method to compute a reformulation which is essentially equivalent to 



(IP-EQ-N). 



We call pP-Rp the rangespace reformulation of (lIPj) ; and (IP-EQ-N) the nullspace reformulation 



of (IP-EQ). 



These reformulation methods are very easy to describe (as opposed to say H. W. Lenstra's 
method), but seem difficult to analyze. The only analyses are for knapsack problems, with the 
weight vector having a given "decomposable" structure, i.e. 

a = \p + r (1.2) 

with p, r, and A integral, and A large with respect to ||p||, and ||r||, see [3[ 114] . 

The results in these papers are a first step towards a general analysis. However, besides as- 
suming the decomposable structure a priori, they only prove an upper bound on the width in the 
reformulations along the last variable. 

The goal of this paper is to prove such width results on the knapsack feasibility problem 

(KP) 



01 


< ax < 


ih 





< x < 

xez n , 


V 



where a is a positive, integral row vector, (3%, and /?2 are integers without assuming any structure 
on a. We will assume that a has low density. The density of a set of weights a = (a±, . . . , a n ) is 

n 

fog2 ll a lloo 

Subset sum problems (when /3i = /?2 = /?, and v is the vector of all ones) with the weight vector 
having low density have been extensively studied. The seminal paper of Lagarias and Odlyzko [16] 
proves that the solution of all but at most a fraction of l/2 ra subset sum problems, which have a 



solution, and have density less than c/n can be found in polynomial time, where c ~ 4.8. Clearly 
d(a) < c/n is equivalent to 2 n ' c <||o[| 00 . 

Let 

G n (M) = {aeZ n \a t £{l,...,M}}. (1.4) 

Furst and Kannan in [9] showed that for some c > constant, if M > 2 cnlogn , then for almost all 
a G G n (M) and all j3 the problem (JKP]) has a polynomial size proof of feasibility or infeasibility. 
Their second result shows that for some d > constant, if M > 2 , then for almost all a G G n {M) 
and all (3 the problem (IKPj) can be solved in polynomial time. Their proof works by constructing 
a candidate solution to (jKPp . and showing that for almost all a G G n (M), if there is a feasible 
solution, then it is unique, and the candidate solution must be it. 

If we assume the availability of a lattice oracle, which finds the shortest vector in a lattice, then 
the result of [16J can be strengthened to only requiring the density to be less than 0.6463. The 
current best result on finding the solution of almost all (solvable) subset sum problems using a 
lattice oracle is by Coster et al [6]: they require only d(a) < 0.9408. 

The rangespace reformulation of (JKP]) is 

01 < ally < (3 2 
< Uy < v (KP-R) 

y ez n , 

where U is a unimodular matrix that makes the columns of U reduced in the LLL-sense (we 

w 

do not analyze it with KZ-reduction). The nullspace reformulation is 

-x < VX <v-x p 

X G Z n ~ m , 

where xp G Z n , axp = ft, { VX \ X G Z n ~ m } = N(a), and the columns of V are reduced in the 
LLL-sense. 

We will assume ||o|| > 2^ n ' 2+1 ' n . which is satisfied, when d{a) < 2/(n + 2). We will not assume 
any a priori structure on a. In fact, a key point will be that a decomposable structure is automat- 
ically "discovered" by the reformulations. Precisely, we will prove that in both reformulations a 
decomposition a = Xp + r can be found from the transformation matrices, now with only p integral, 
and that branching on the last variable in the reformulations will be equivalent to branching on px 
in the original problem. 

There are crucial differences between the results that assume a decomposable structure, and 
the results of this paper. For instance, in |14j one needs to assume 

A > 2< n - 1 )/ 2 ||p|| (||r|| +1) 2 , (1.5) 

A > 2("- 1 )/ 2 |H| 2 ||r|| 2 , (1.6) 



for the analysis of the rangespace- and nullspace reformulations, respectively. A decomposition 
with any of these properties is unlikely to exist no matter how large || a || is, so we cannot plug the 
decomposition result of this paper into the argument used in [Hj. We will prove a weaker lower 
bound on A, and an upper bound on || r || /A in Theorems O and|U and we will use these bounds 
in Theorem [5] quite differently from how it is done in [14j . 



Notation Vectors are column vectors, unless said otherwise. The ith. unit row- vector is e«. In 
general, when writing pi, p2, etc, we refer to vectors in a family of vectors. When pi refers to the 
zth component of vector p, we will say this explicitly. For a rational vector b we denote by round(fe) 
the vector obtained by rounding the components of b. 

We will assume < (3\ < 02 < av, and that the gcd of the components of a is 1. 

For a polyhedron Q, and an integral row-vector c, the width, and the integer width of Q along 
c are 

width(c, Q) = max { ex \ x £ Q } — min { ex \ x £ Q }, and 
iwidth(c, Q) = [_ max {cx\x£Q}\ — [min { ex \ x £ Q }] +1. 

The integer width is the number of nodes generated by branch-and-bound when branching on the 
hyperplane ex; in particular, iwidth(ej, Q) is the number of nodes generated when branching on X{. 
If the integer width along any integral vector is zero, then Q has no integral points. Given an integer 
program labeled by (P), and c an integral vector, we also write width(c, (P)), and iwidth(c, (P)) 
for the width, and the integer width of the LP-relaxation of (P) along c, respectively. 

A lattice in M. n is a set of the form 

L = L(B) = {Bx\xeZ n }, (1.7) 

where B is a real matrix with n independent columns, called a basis of L. A square, integral matrix 
U is unimodular if detU = ±1. It is well known that if B\ and B<i are bases of the same lattice, 
then B2 = B\U for some unimodular U. The determinant of L is 

detL = (det B T B) 1/2 , (1.8) 

where B is a basis of L; it is easy to see that detL is well-defined. 

The LLL basis reduction algorithm [T7j computes a reduced basis of a lattice in which the 
columns are "short" and "nearly" orthogonal. It runs in polynomial time for rational lattices. For 
simplicity, we use Schrijver's definition from [23J . Suppose that B has n independent columns, i.e. 

B = [h,..., b n ], (1.9) 

and &*,...,&* form the Gram-Schmidt orthogonalization of b\, . . . , b n , that is b\ = b\, and 

h = b* + Y, KM with tHj = bfb*/ || b* || 2 (t = 2, . . . , n; j < i - 1). (1.10) 



We call &!,...,&„ an LLL-reduced basis of L(5), if 



|/%| < 1/2 (« = 2,...,n; j = l,...,i-l), and (1.11) 

Ik* II 2 < 2||6* +1 || 2 (i = l,...,n-l). (1.12) 

For an integral lattice L, its orthogonal lattice is denned as 

L L = { y G Z n | y T x = Vx G L }, 

and it holds that (see e.g. [20] ) 

detL" 1 < detL. (1.13) 

Suppose A is an integral matrix with independent rows. Then recalling (jl.ip . N(.A) is the same as 
L(j4 ) . A lattice L C Z" is called complete, if 

L = lin L n Z n . 

The following lemma summarizes some basic results in lattice theory that we will use later on; for 
a proof, see for instance |20J. 



Lemma 1. Let V be an integral matrix with n rows, and k independent columns, and L = L(V). 
Then (1) through (3) below are equivalent. 

(1) L is complete; 

(2) det L 1 - = det L; 

(3) There is a unimodular matrix Z s.t. 

( h \ 



ZV 



y ^(n-k)xk / 



Furthermore, if Z is as in part £2J) ; then the last n — k rows of Z are a basis of L . 

a 

For an n-vector a, we will write 

f{a) = 2"/4/|| a ||i/n 

g (a) = 2(™- 2 )/ 4 /||a|| 1 /("-i). l ' > 



2 Main results 



In this section we will review the main results of the paper, give some examples, explanations, and 
some proofs that show their connection. The bulk of the work is the proof of Theorems [31 HI and 
[5l which is done in Section El 

The main purpose of this paper is an analysis of the reformulation methods. This is done 
in Theorem QJ which proves an upper bound on the number of branch-and-bound nodes, when 
branching on the last variable in the reformulations. However, some of the intermediate results 
may be of interest on their own right. 

We will rely on Theorem O proven in the companion paper [22J , which gives a bound on the 
determinant of a sublattice in an LLL-reduced basis, thus generalizing the well-known result from 
[17j showing that the first vector in such a basis is short. 

Theorems [3] and [4] show that an integral vector p, which is "near parallel" to a can be extracted 
from the transformation matrices of the reformulations. The notion of near parallelness that we use 
is stronger than just requiring | sin(a,p)| to be small, and the relationship of the two parallelness 
concepts is clarified in Proposition [TJ A method to find a near parallel vector using simultaneous 
diophantine approximation was described by Frank and Tardos in [8]. Their goal was quite different 
from ours, and a near parallel vector derived via diophantine approximation is not suitable for the 
analysis of the reformulation methods. For completeness, we will give an overview of their method 
in subsection 14.11 

Theorem [5] proves an upper bound on iwidth(p, (IKPj) ). where p is an integral vector. A novelty 
of the bound is that it does not depend on /?i, and /%, only on their difference. We show through 
examples that this bound is quite useful when p is a near parallel vector found according to 
Theorems [3] and EJ 

In the end, a transference result between branching directions in the original, and reformulated 
problems completes the proof of Theorem [TJ 

Theorem 1. Suppose \\a\\> 2( n / 2+1 ) n . Then 



(1) iwidth(e„, (HEED) < L/(«)( 2 IMI +(& - A))J + I- 

(2) iwidth(e n _i, (HE!])) < [2g{a) \\v\\\ +1. 



□ 



The integer width, and the width differ by at most one, and are frequently used interchangeably 
in integer programming algorithms. For instance, the algorithms of |X8|, [T9] find a branching 
direction in which the width is bounded by an exponential function of the dimension. The goal is 
proving polynomial running time in fixed dimension, and this would still be achieved if the width 
were larger by a constant. 



In contrast, when \\a\\ is sufficiently large, Theorem Q] implies that the integer width is at most 
one in both reformulations. 

The following was proven in [22j: 

Theorem 2. Suppose that bi,...,b n form an LLL-reduced basis of the lattice L, and denote by 
Li the lattice generated by b\, . . . ,bg. Then 

detL e < 2 i{n - £ V\detLf n . (2.15) 

Theorem [2] is a natural generalization of ||6i ||< 2( n ~ 1 " 4 (det L) l ' n (see |17j). 

Given a and p integral vectors, we will need the notion of their near parallelness. The obvious 
thing would be to require that | sin(a,p)| is small. Instead, we will write a decomposition 

a = Xp + r, with A £ Q, r £ Q n , r±p, (DECOMP) 

and ask for || r || /A to be small. The following proposition clarifies the connection of the two near 
parallelness concepts, and shows two useful consequences of the latter one. 



Proposition 1. Suppose that a, p £ IT 1 , andr and X are defined to satisfy {3.21 ). Assume w.l.o.g. 
X > 0. Then 

(1) sin(a,p) <\\r\\ /A. 

(2) For any M there is a, p with ||a||> M such that the inequality in {7J) is strict. 

(3) Denote by pi and a% the ith component of p, and a. If \\ r \\ /A < 1, and pi ^ 0, then the 
signs of pi and aj agree. Also, i/||r||/A < 1/2, then [ai/X] = Pi. 

Proof Statement ([I]) follows from 

sin(a,p) = || r || / || a\\ < \\r \\ / \\ Xp\\ < \\r \\ /A, (2-16) 

where in the last inequality we used the integrality of p. 

To see (|2|), one can choose a and p to be near orthogonal, to make \\r \\ /A arbitrarily large, 
while sin(a,p) will always be bounded by 1. A more interesting example is from considering the 
family of a, and p vectors 

a = f m 2 + 1, m? ) , 

(2.17) 
p = m + 1, m] 



with m an integer. Letting A and r be defined as in the statement of the proposition, a straight- 
forward computation (or experimentation) shows that as m — > oo 

sin(a,p) — ► 0, 

|| r || /A -» 1 



Statement (J3j) is straighforward from 

Oi/A = Pi + ri/X. (2.18) 

D 

The next two theorems show how the near parallel vectors can be found from the transformation 
matrices of the reformulations. 

Theorem 3. Suppose \\a\\ > 2^ n ' 2+1 ' n . Let U be a unimodular matrix such that the columns of 



are LLL-reduced, and p the last row of U . Define r and A to satisfy {3. 21 ), and assume w.l.o.g. 
A >0. 

Then 

(1) ||p||(l+||r|| 2 ) 1/2 <IMI.f(a); 

(2) A > l//(o); 

(3) ||r||/A<2/(o). 

D 

Theorem 4. Suppose || a || > 2^ n ' 2+1 ' n . Let V be a matrix whose columns are an LLL-reduced 
basis ofN(a), b an integral column vector with ab = 1, and p the (n — l)st row of (V,b) . Define 
r and A to satisfy fi3.27\) , and assume w.l.o.g. A > 0. 

Then r/0, and 



(1) ||p|||k||<||o|| g(a); 

(2) ||r||/A<2 5 (a). 



a 



It is important to note that p is integral, but A and r may not be. Also, the measure of 
parallelness to a, i.e. the upper bound on || r || /A is quite similar for the p vectors found in 
Theorems [3] and HI but their length can be quite different. When || a \\ is large, the p vector in 
Theorem[3]is guaranteed to be much shorter than a by A > l//(o). On the other hand, the p vector 
from Theorem H] may be much longer than a : the upper bound on || p [| || r || does not guarantee 
any bound on \\p\\, since r can be fractional. 

The following example illustrates this: 



Example 1. Consider the vector 

a = (3488, 451, 1231, 6415, 219l) . (2.19) 

We computed p%, n, X\ according to Theorem 0; 



(2.20) 



Pi = (62, 8, 22, 114, 39J, 

n = H).2582, 0.9688, -6.5858, 2.0554, -2.902l) , 

Ai = 56.2539, 

|| n|| /A! = 0.1342. 

We also computed P2, r 2 , A2 according to Theorem^ note \\p2 ||>||a||: 

p 2 = (l2204, 1578, 4307, 22445, 7666) 

r 2 = (—0.0165, -0.0071, 0.0194, 0.0105, -0.014o) 

A 2 = 0.2858 

||r 2 ||/A 2 = 0.1110. 



(2.21) 



a 



Theorem [5] below gives an upper bound on the number of branch-and-bound nodes when branch- 
ing on a hyperplane in (jKPp . 



Theorem 5. Suppose that a = Xp + r, with p > 0. Then 



iwidth(p, (TJD3)) < 



MIIMI P2-P1 



x 



+ 1. (2.22) 



This bound is quite strong for near parallel vectors computed from Theorems [3] and HI For 
instance, let a, pi, n, Ai be as in Example [TJ If f3\ = P2 in a knapsack problem with weight vector 
a, and each xi is bounded between and 11, then Theorem [5] implies that the integer width is at 
most one. At the other extreme, it also implies that the integer width is at most one, if each Xi is 
bounded between and 1, and /3 2 — Pi < 39. However, this bound does not seem as useful, when 
p is a "simple" vector, say a unit vector. 

We now complete the proof of Theorem[ll based on a simple transference result between branch- 
ing directions, taken from [14| . 

Proof of Theorem [1] 

Let us denote by Q, Q, and Q the feasible sets of the LP-relaxations of (|KP|) . of (|KP-R| . and 
of (|KP-Nj) . respectively. 

10 



First, let U, and p be the transformation matrix, and the near parallel vector from Theorem [3l 
It was shown in p2| that iwidth(p, Q) = iwidth(pU, Q). But pU = ±e n , so 



iwidth(p, Q) 



On the other hand, 



iwidth(p, Q) < 
< 



\r\\\\v 



iwidth(e n ,Q). 

02- A 



+ 



+ 1 



A A 

/(a)(2HI+(/?2-/?i))J + I 



(2.23) 



(2.24) 



with the first inequality coming from Theorem \5\ and the second from using the bounds on 1/A 
and ||r|| /A from Theorem [3j Combining (J2.23P and (|2.24[) yields ([I]) in Theorem [TJ 

Now let V, and p be the transformation matrix, and the near parallel vector from Theorem [U 
It was shown in [14j that iwidth(p, Q) = iwidth(pV, Q) . But pV = ±e n _i, so 



iwidth(e n _i , Q) = iwidth(p, Q). 



On the other hand, 



iwidth(p, Q) < 



A 



+ 1 



(2.25) 



(2.26) 



< l9(a)(2\\v\ 



+ 1. 



with the first inequality coming from Theorem [5J and the second from using the bound on \\r\\ /A 
in Theorem H Combining ([2^5]) and flgJjggD yields © in Theorem HJ 



3 Proofs 



3.1 Near parallel vectors: intuition, and proofs for Theorems [3] and [4] 

Intuition for Theorem [3] We review a proof from p3], which applies when we know a priori the 
existence of a decomposition 

a = p\ + r, (3.27) 

with A large with respect to ||p||, and ||r|| . The reason that the columns of 




are not short and orthogonal is the presence of the XiPi components in the first row. So if postmul- 
tiplying by a unimodular U results in reducedness, it is natural to expect that many components 
of pU will be zero; indeed it follows from the properties of LLL-reduction, that the first n — 1 
components will be zero. Since U has full rank, the nth component of pU must be nonzero. So 
p will be the a multiple of the last row of U~ l , in other words, the last row of U^ 1 will be near 



11 



parallel to a. (In [14J it was assumed that p, r, and A are integral, but the proof would work even 
if A and r were rational. ) 

It is then natural to expect that the last row of C/" 1 will give a near parallel vector to a, even 
if a decomposition like (I3.27J) is not known in advance. This is indeed what we show in Theorem 
[3l when || a || is sufficiently large. 

Proof of Theorem [3] First note that the lower bound on ||a|| implies 

/(a) < \/3/2. (3.28) 

Let Li be the lattice generated by the first I columns of I U, and 

■ - C "-') 

Clearly, Z is unimodular, and 

i a ") - („::)• 

So Lemma Q] implies that Lg is complete, and the last n + 1 — £ rows of Z generate Lj~. The last 
row of Z is (1, —a), and the next-to-last is (0,p), so we get 



(3.30) 



detL n = detL^ = (||a|| 2 +1) 1 / 2 , 
detL n _i = detL^_ 1 =||p|| (1+ ||r || 2 ) 1/2 - 

Theorem [2] implies 

detL n _i < 2( n " 1 )/ 4 (detL n ) 1 - 1 / n . (3.31) 

Substituting into (|3.31|) from (|3.30p gives 

IIpII (i+lkll 2 ) 1/2 < 2( n - 1 )/ 4 (VII«|| 2 +i) 1_1/n 

< 2™/ 4 Hall 1 " 1 /™ (3.32) 

= l|a||/(a), 
with the second inequality coming the lower bound on ||a||. This shows ([TJ. 

Proof of ([2]) From ([1]) we directly obtain 

/>/ \9 II Il9 II II? />/ \9 ll ||9 ll ||9|| ||9 

flaY \\a\r - \\r\r flar \\a\r - \\p\\ \\r\r 



1119 — II 119 

\P\\ \\P\\ 



> 1 (3.33) 

/(a) 2 HI 2 
/(a) 2 II a || 2 ' 

12 



where in the first inequality we used ||p||> 1- Now note 

II ||2^ £/ \2 n ||2 

||p||< f{a) \\a\\ , 

i.e. the the denominator of the first expression in (|3.33p is not larger than the denominator of the 
last expression. So if we replace /(a) 2 by 1 in the numerator of both, the inequality will remain 
valid. The result is 

n n2 n n2 1 

\\ n \\ — r 

" "■■ I, I " > JTv' ( 3 - 34 ) 

\\p\\ j( a ) 

which is the square of the required inequality. 



Proof of © We have 



I Il2 II Il2 II Il2 

?" TIT* 



< 



A 2 - || Xp 
< 



,,2ll Il2 



|a|| 2 



< 



< 



/(aril a I 

Ml 2 -Ik || 2 (3.35) 

f(a?\\a\\* 



| a|| 2 —f(a) 2 || a || 2 



1 - /(a) 2 
< 4/(a) 2 , 

where the first inequality comes from Proposition [H the last from (|3.28p . and the others are 
straightforward. 

D 

Intuition for Theorem |4] We recall a proof from [14], which applies when we know a priori the 
existence of a decomposition like in (|3.27[) with A large with respect to \\p ||, and || r ||, and p not 
a multiple of r. It is shown there that the first n — 2 components of pV will be zero. Denote by Li 
the lattice generated by the first I columns of V. So p is in £^_ 2 > an d it is not a multiple of a, 
but it is near parallel to it. 

So one can expect that an element of L^_ 2 which is distinct from a will be near parallel to a, 
even if a decomposition like (|3.27p is not known in advance. The p described in Theorem [5] will be 
such a vector. 

Proof of Theorem [4] The lower bound on \\a\\ implies 

g(a) < y/l/2. (3.36) 

As noted above, let Lp be the lattice generated by the first £ columns of V. We have 

<y,b)- x V = p-M. (3.37) 

13 



So Lemma \T\ implies that Lp is complete, and the last n — £ rows of (V, b)~ l generate Lj. It is 
elementary to see that the last row of (V, 6)" 1 is a, and by definition the next-to-last row is p, and 
these rows are independent, so r / 0. Also, 

detL n _i = detL^_ x =\\a\\, 

detL n _2 = detL^_ 2 = ||p||||HI • 
Theorem [2] with n — 1 in place of n, and n — 2 in place of £ implies 

detL n _ 2 < 2( n - 2 )/ 4 (detL n _ 1 ) 1 - 1 /(«-i). (3.39) 

Substituting into (|3.39|) from (J3.38P gives 

II v II II r II < 2( n " 2 )/ 4 Holl 1-1 /^-!) 

" - ,, „ , ' " (3-40) 



as required. 

Proof of ([2]) It is enough to note that in proof of ([3|) in Theorem [3] we only used the inequality 
||p|| 2 || r || 2 ^ fi a ) 2 \\ a \\ 2 ■ So the exact same argument works here as well with g(a) instead of f(a), 
and invoking (J3.36P as well. 

□ 



3.2 Branching on a near parallel vector: proof of Theorem [5] 

This proof is somewhat technical, so we state, and prove some intermediate claims, to improve 
readability. Let us fix a, p, (3±, fo, and v. For a row- vector w, and an integer £ we write 

ma.x(w,£) = max{ tra \px < £, < x < v } 
min(ui, £) = min { wx \px > £, < x < v}. 

The dependence on p, on v, and on the sense of the constraint (i.e. <, or > ) is not shown by this 
notation; however, we always use px < £ with "max" , and px > £ with "min" , and p and v are 
fixed. Note that as a is a row-vector, and v a column-vector, av is their inner product, and the 
meaning of pv is similar. 

Claim 1. Suppose that £\ and £2 are integers in {0, . . . ,pv}. Then 

min(a,^ 2 )-max(a,£i) > - \\r\\\\v\\ +X(£ 2 - h). (3.42) 



Proof The decomposition of a shows 

max(o, £\) < nrax(r, 1%) + \£i, and 

I o.4to ) 

min(a,£2) > min(r, £2) + ^£2- 
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So we get the following chain of inequalities, with ensuing explanation: 

min(a, £2) — max(a, £\) > min(r, £2) — max(r, £\) + A(^ 2 — ^1) 

> rx 2 - rxi + \{£ 2 - £1) , g ^ 
= r(x 2 - Xl ) + \(£ 2 - £1) 

> -||r||||u||+A(£ 2 -^i). 

Here X2 and x\ are the solutions that attain the maximum, and the minimum in min(r, £2) and 
max(r, £\), respectively. The last inequality follows from the fact that the ith component of X2 — x\ 
is at most vi in absolute value, and the Cauchy- Schwartz inequality. 

End of proof of Claim [TJ 

Next, let us note 

min(a, k) < max(a, k) for k G {0, . . . ,pv}. (3.45) 

Indeed, (|3.45|) holds, since the feasible sets of the optimization problems defining min(a, k), and 
max(a, k) contain {x\px = k, < x < t> }. 

The nonnegativity of p and of a imply min(a, 0) = 0, and max(a,pe) = av. The proof of the 
following claim is trivial, hence omitted. 

Claim 2. Suppose that £\ and £2 are integers in {0, . . . ,pv} with £\ + 1 < £2, and 

max(a^i) < 0i < (3 2 < min(o,^ 2 ). (3.46) 

Then for all x with f3\ < ax < (32 , < x < v 

h<px< £ 2 (3.47) 

holds. 

We assume for simplicity 

max(a, 0) < f}\ < $2 < min(a,pe); (3.48) 

the cases when this fails to hold are easy to handle separately. Let £\ be the largest, and £2 the 
smallest integer such that 

max(a,4) < Pi < fa < min(a,^ 2 )- (3.49) 

From (I3T43I) £ 2 > £\ + 1 follows, and Claim [2] yields 

iwidth(p, (jKFjl ) < l 2 -l\-\. (3.50) 

By the choices of £\ , and £2 we have 

0i < max(a,£i + 1), and /3 2 > min(a,£ 2 — 1), (3.51) 

15 



hence Claim [T] leads to 

/?2 — /?i > min(a, £2 — 1) — max(a, l\ + 1 



that is 



(3.52) 
> - ||r||||'y||+A(4-4-2), 



£ 2 - h -2 < ^ + M, (3.53) 

A A 



Comparing (j3.50[) and (13.531) yields completes the proof. 



□ 



4 Discussion 



4.1 Connection with diophantine approximation, and other notions of near par- 
allelness 

Given a rational vector b, simultaneous diophantine approximation (see e.g. |17^ 115] ) computes 
an integral vector p, and an integer q, such that q, and \\ b — (l/q)p || are both small. Frank 
and Tardos in [8] has explored the following methodology to compute a vector p that is near 
parallel to an integral vector a. They apply diophantine approximation to (1/ || a ||oo 0,, then set 
^ =11 a II 00 /q, r = a — Xp. Then \\r\\ /A will be small, and if || a || is large, then A will be large. [J]. 

The relevance of Theorems [3] and |4] is not just finding near parallel vectors: it is finding a near 
parallel p, which corresponds to a unit vector in the rangespace- and nullspace reformulations, thus 
leading to the analysis of Theorem [TJ 

Finding an integral vector, which is near parallel to an other integral or rational one has other 
applications as well. In [llj Huyer, and Neumaier studied several notions of near parallelness, 
presented numerical algorithms, and applications to verifying the feasibility of a linear system of 
inequalities. 



4.2 Successive approximation 

Theorems [3] and U] approximate a by a single vector. It is natural to ask: if one row of U , or 
of (V, 6) _1 is a good approximation of a, can we construct a better approximation from 2,3, ... ,k 
rows? 

The answer is yes, and we outline the corresponding results below, and their proofs, which are 
slight modifications of the proofs of Theorems [3] and HI As of now, we don't know how to use the 
general results for a better analysis of the reformulations than what is already given in Theorem [TJ 



1 Thanks are due to Laci Lovasz and Fritz Eisenbrand for pointing out this connection 
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So we mainly state the successive approximation results for the interesting geometric intuition 
they give. Let us define 

f(a,k) = 2( fc ( n " fc ) +1 )/ 4 / ||af/ n 
g(a,k) = 2 fe ("- 1 - fe )/ 4 / ||a|| (fc_1)/n • 

The successive version of Theorem [3] is given below: 

Theorem 6. Let a£Z™ be a row-vector, with ||a||> 2^ n ' 2+1 ' n , U a unimodular matrix such that 
the columns of 

are LLL-reduced, and P k the (integral) submatrix ofU 1 consisting of the last k rows. Furthermore, 
let a{k) be the projection of a onto the subspace spanned by the rows of P k , r = a — a(k), and 

X k : = \\a(k)\\/det(P k P^ 2 . 

Then 

(1) (det(P k P k T )) 1 / 2 (l+ Hrll 2 ) 1 ^ <|| a || /(ajfc ) ; 

(2) X k >l/f(a,k); 

(3) |sin(a,a(fc))| <\\r \\ /\ k < 2/(a, k). 



Proof sketch We will use the notation of Theorem [3j In its proof we simply change (|3.30p (we 
copy the first expression for det L n for easy reference) to 



detL n . k = detL^_ k = (det^Pj)) 1 / 2 ^ ||r f) 1 ^, 



(4.55) 



detL„ = detlA = (||o|| 2 +1) 1/2 , 
d. 

and (pPTj) to 

detL n _ fc < 2 fc ( n - fc )/ 4 (detL n ) 1 " fc / n . (4.56) 

Then substituting into (j4.56[) from (14. 55ft gives 

(det(P fc Pj)) 1 /2(l + || r ||2)l/2 < 2 (fc(n-fc))/4( v /|| a ||2 +1 )l-fe/n 

< 2 ( fc ( n - fe ) +1 )/ 4 / ||a|| fe / n (4.57) 

= \\a\\ f(a,k), 

with the second inequality coming the lower bound on \\a\\. This shows ([T]), and the rest of the 
proof follows verbatim the proof of Theorem O □ 

Theorem |4] also has a successive variant, which is 
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Theorem 7. Suppose \\ a || > 2^ n ' 2+1 ' n . Let V be a matrix whose columns are an LLL-reduced 
basis ofN(a), b an integral column vector with ab = 1, k < n — 1 an integer, and P k the (integral) 
submatrix of {V,b)~ l consisting of the next-to-last k rows. 

Furthermore, let a(k) be the projection of a onto the subspace spanned by the rows of P k , r = 
a — a(k), and 

X k :=\\a(k)\\ / detiPkP^) 1 ' 2 . 

Then r ^ 0, and 

(1) (det(iVf)) 1/2 [M|<||a||s(a,fc); 

(2) |sin(o,o(fe))| <||r|| /A < 2g(a,k). 



Proof sketch We will use the notation of Theorem [H We need to replace (|3.38p with 

debLn ~ 1 = det ^-i=IMI> (4.58) 

detL n _!_ fe = detL^^ = (det^Pj)) 1 ^ || r || . 

Theorem [2] with n — 1 in place of n, and n — 1 — k in place of £ implies 

detL n _i_ fe < 2 fc ( n - 1 - fc )/ 4 (detL n _i) 1 - fc /( n - 1 ). (4.59) 

Plugging the expressions for detL n _i and detL n _i_fc from (I4,58h into (j4.59[) gives 

(det( J P fcJ Pj)) 1 /2 || r || < 2 k(n~l-k)/4 ^ a ^l-k/(n-l) 

= g(a, k) || a ||, 



(4.60) 



proving ([T]). The rest of the proof is an almost verbatim copy of the corresponding proof in Theorem 
H □ 
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